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Abstract 



Many important problems in discrete optimization require maximization of a monotonic submodular function subject to matroid 
constraints. For these problems, a simple greedy algorithm is guaranteed to obtain near-optimal solutions. In this article, we extend 
this classic result to a general class of adaptive optimization problems under partial observability, where each choice can depend 
on observations resulting from past choices. Specifically, we prove that a natural adaptive greedy algorithm provides a + 1) 
approximation for the problem of maximizing an adaptive monotone submodular function subject to p matroid constraints, and more 
generally over arbitrary p-independence systems. We illustrate the usefulness of our result on a complex adaptive match-making 



£S| application. 
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1. Introduction 

Submodular functions play an important role in discrete opti- 
mization. Many important problems, such as facility location 1 1 1, 
coverage [2], influence maximization J3|, and experimental de- 
sign [4 1 can be reduced to constrained maximization of a sub- 
modular set function. Submodularity, informally, is an intuitive 
notion of diminishing returns, which states that adding an ele- 
ment to a small set helps more than adding that same element to 
a larger (super-)set. While maximizing submodular functions in 
general is NP-hard, a celebrated result of Nemhauser et al. Q 
shows that a simple greedy algorithm is guaranteed to find a 
near-optimal solution to the problem of maximizing a submodu- 
lar function subject to cardinality constraints; the value of the 
greedy solution obtains at least a constant fraction of (1 — 1/e) 
of the optimal value. Nemhauser and Wolsey [5 | furthermore 
show that, in general, obtaining better approximation guarantees 
requires evaluating the objective function on an exponential num- 
ber of sets. Beyond cardinality constraints, the greedy algorithm 
also obtains guarantees for maximizing a monotone submodular 
function subject to p matroid constraints J6), and, more gen- 
erally, p-independence systems Q. In both cases, the greedy 
algorithm achieves at least a constant fraction of l/(p + 1) of 
the optimal value. 

Submodular optimization provides a unified framework for 
many important non-adaptive optimization problems. In many 
practical optimization problems, however, one needs to adop- 
tively make a sequence of decisions, taking into account observa- 
tions about the outcomes of past decisions. Often these outcomes 
are uncertain, and one may only know a probability distribution 
over them. Finding optimal policies for decision making in 
such partially observable stochastic optimization problems is 
notoriously intractable (see, e.g., [8|). The classical notion of 
submodular set functions does not allow one to handle such 



adaptive optimization problems. In recent work J9] HP! . we 
introduced the concept of adaptive submodularity, a natural 
generalization of submodular set functions to adaptive policies, 
and prove that if a partially observable stochastic optimization 
problem satisfies this property, a simple adaptive greedy algo- 
rithm is guaranteed to obtain near-optimal solutions in the case 
of cardinality constraints. The concept has several useful appli- 
cations, including active learning, machine diagnosis, adaptive 
viral marketing, and sensor placement JTOjQT). 

In this article, we prove that the adaptive greedy algorithm is 
guaranteed to obtain a l/(p + 1) -approximation for the more 
general problem of maximizing an adaptive submodular func- 
tion over p-independence systems (and therefore subject to p 
matroid constraints). Our results generalize those obtained by 
Asadpour et al. 1 12] for optimizing a particular instance of adap- 
tive submodular functions to a single matroid constraint to arbi- 
trary adaptive monotone submodular functions and to arbitrary 
p-independence systems. We illustrate the usefulness of this 
result on adaptive match-making problems such as online dat- 
ing. We show that the constraint system in this application is a 
2-independence system, and thus the adaptive greedy algorithm 
provides a 1 /3-approximation. 

2. Background 

We first review background on submodular optimization, as 
well as the recently discovered adaptive submodularity property. 

2.1. Submodular optimization over p-independence systems 

Let E be a finite set of items that we may select among. 
Consider an objective function / : 2 E — > K. For each e £ E let 
A(e | A) :— f{A U {e}) — f{A) denote the marginal benefit 
of item e w.r.t. set A. We call / monotone iff A(e | ^4) > 
for all e € E and A C E. We call / submodular, iff for all 



Ac B £E and e£E\B it holds that A(e | A) > A(e | B), 
i.e., adding e to a set A provides larger marginal benefit than 
adding it to set B. Many non-adaptive problems in discrete 
optimization can be reduced to finding a set A that maximizes a 
monotone submodular function / subject to some constraints. 

One natural class of constraints that we wish to consider 
are matroid constraints, which require the solution to be an 
independent set of a matroid: 

Definition 1 (Matroid). A matroid is a pair (E,l) with a 
nonempty collection 1 C 2 E of independent sets such that 

• For all A C B C E, if B £ 1 then A el. 

• For all A,B el with \B\ > \ A\, there exists e £ B\A 
such that Au{e} el 

Matroids were developed as an abstraction of families of 
independent objects, such as sets of linearly independent vectors 
in a vector space. One important example is the uniform matroid, 
where sets are independent iff they contain at most k items for 
some fixed k, i.e., we wish to pick a set A that maximizes / 
over all sets of size at most k. Other important examples include 
partition matroids and graphical matroids. 

A yet more general class of constraints are p- independence 
system constraints, which require the solution to be an indepen- 
dent set of a p-independence system: 

Definition 2 (p-independence system). For a parameter p £ 
N, a p-independence system is a pair (E, 1) with a nonempty 
collection 1 C 2 E of independent sets such that 

• For all A C B C E, if B £ 1 then A el. 

• For C C E, let B (C) be the set of maximal 
elements of 1 which are subsets of C. That is, 
B{C) := {B C C : Ve e C \ B, B U {e} $ 1}. Then 
for all nonempty C C E, we have maxggg(c) \B\ < 
p-min BeB{c) \B\. 

One important special case of a p-independence systems is the in- 
tersection of p matroids: (E, C\ p i=1 li) where (E, 1A is a matroid 
for all i. Calinescu et al. [7] discuss other special cases, such as 
p-circuit-bounded families considered in lfl3l and p-extendible 
families defined in [14]. 

While maximizing submodular functions is hard even for a 
single uniform matroid constraint, a celebrated result of Fisher et 
al. |6) is that a simple greedy algorithm obtains a l/(p + 1) ap- 
proximation for the problem of maximizing a monotone submod- 
ular function subject to the intersection of p matroids. Hereby, 
the greedy algorithm (under constraint 1) starts with the empty 
set A = 0, and iteratively adds an item 

e* £ argmax A(e | A). (1) 

e : AU{e}<EX 

to A until A is a maximal set in 1. 

In some applications, implementing the greedy rule ([T]l may 
by itself require solving an NP-hard optimization problem (e.g., 
selecting tours for information-gathering robots 03)). Fortu- 
nately, the proof given in |6| can be extended to show that 
for any a £ [0, 1] an a-approximate greedy algorithm obtains 
an approximation guarantee of -2L_ . Here, an a-approximate 



greedy algorithm is one that selects an a-approximation to the 
best greedy element in each step, i.e., an item e* such that 
AU {e*} £ 1 and A(e | A) > a ■ max e . Au{e}eI A(e | A), 
until A is maximal in 1. Furthermore, this result holds also for 
the more general case of a p-independence system constraint. 
Extending the proof to an a-approximate greedy algorithm was 
done by Goundan and Schulz in the case of the intersection of 
p matroids [16], and by Calinescu et al. for the more general 
p-independence system constraint 0. 

2.2. Adaptive optimization 

We are interested in adaptive optimization problems, where 
our choice of items can depend on observations about items 
chosen previously. We will now formalize the class of adaptive 
optimization problems that we consider. 

Each item e £ E is in a particular (initially unknown) state 
$(e) £ O from a set O of possible states. Hereby, <£> : E — > O is 
a (random) realization of the ground set, indicating which state 
each item is in. We take a Bayesian approach and assume that 
there is a (known) probability distribution P [$] over realizations. 
We will consider problems where we sequentially pick an item 
e £ E, get to see its state $(e), pick the next item, get to see 
its state, and so on. After each pick, our observations so far can 
be represented as a partial realization fy, a function from some 
subset of E (i.e., the set of items that we already picked) to their 
states. For notational convenience, we sometimes represent \fr 
as a relation, so that f C Ji x O equals {(e, o) : ^(e) = o}. 
We use the notation dom^) = {e : 3o.(e, o) £ to refer to 
the domain of \& (i.e., the set of items observed in ty). A partial 
realization W is consistent with a realization $ if they are equal 
everywhere in the domain of In this case we write $ ~ ^. 
If and are both consistent with some <i>, and dom(^) C 
dom(W), we say ^ is a subrealization of Equivalently, 
is a subrealization of if and only if C 

We encode our adaptive strategy for picking items as a policy 
7r, which is a function from a set of partial realizations to E, 
specifying which item to pick next under a particular set of 
observations. If ^ <^ dom(7r), the policy terminates (stops 
picking items) upon observation of "f. Technically, we require 
that the domain of tt is closed under subrealizations. That is, if 
£ dom(7r) and \E" is a subrealization of ^' then ^> £ dom(7r). 
We use the notation E(n, <E>) to refer to the set of items selected 
by tt under realization $. 

We wish to maximize, subject to some constraints, a utility 
function / : 2 E x O e — > M>o that depends on which items we 
pick and which state each item is in. Based on this notation, the 
expected utility of a policy tt is / aV g(7r) := E [f(E(n, $),$)] 
where the expectation is taken with respect to P [$] . 

In this paper, we address the problem of optimization subject 
to a p-independence system. Thus, our goal is to find a policy 
tt* such that 

tt* £ arg max / av g(7r) subject to E(tt, $) € 1 for all (2) 

7T 

where I is a collection of independent sets defined by a p- 
independence system. 
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2.3. Adaptive submodularity 

In general, Problem (2) cannot be approximated to within 
a factor of C(|i?| 1_e ) for any constant e > under the rea- 
sonable complexity-theoretic assumption that PH ^ Ef [ 10 1, 
even if / is modular in its first argument for all realizations, i.e., 
even if there exist w e > such that f(A, 4>) = 2~2 e eA w e,<s>- 
However, there is a natural adaptive generalization of monotone 
submodular functions, such that many classic results on the opti- 
mization of monotone submodular functions generalize to the 
adaptive realm [ 10 1. These adaptive generalizations are defined 
in terms of the conditional expected marginal benefits of items, 
that is, the expected marginal benefit of an item, conditioned on 
the current partial realization. 

Definition 3 (Conditional Expected Marginal Benefit). 

Given a partial realization and an item e, the conditional 
expected marginal benefit of e conditioned on having observed 
*, denoted A (e | is 

A(e | *) := E [/(dom(*)U{e} , $) - /(dom(W), $)|$ - *] 

(3) 

where the expectation is taken with respect to P [<&]. 

We are now ready to introduce our generalizations of monotonic- 
ity and submodularity to the adaptive setting: 

Definition 4 (Adaptive Monotonicity 1 10 1). A function f 
2 E x E — > ]R>o is adaptive monotone with respect to dis- 
tribution P [$] ;/ the conditional expected marginal benefit of 
any item is nonnegative, i.e., for all ^ with P [*] > and all 
e G E we have 

A(e|*)>0. (4) 

Definition 5 (Adaptive Submodularity 1 10 1). A function f : 
2 E x O e — > M>o is adaptive submodular with respect to distri- 
bution P [$] if the conditional expected marginal benefit of any 
fixed item does not increase as more items are selected and their 
states are observed. Formally, f is adaptive submodular w.r.t. 
P [$] if for all * and *' such that * is a subrealization of *' 
(i.e., ^ C and for all e G E, we have 

A (el*') < A(e|*). (5) 

Note the similarity with the submodularity condition, which 
we wrote as A(e | B) < A(e | A) for all A C B C E and 
e G E\B specifically to highlight the connection. When P [•] is 
deterministic, so that only one realization can occur, the adaptive 
monotonicity and adaptive submodularity reduce to the classic 
monotonicity and submodularity conditions. 

The power of these definitions is that certain results for mono- 
tone submodular maximization generalize from (non-adaptive) 
sets to (adaptive) policies [ 10 1. In this paper, we will show that 
this is true even for the general case of maximizing a monotone 
submodular function under an arbitrary p-independence system 
constraint. 



3. Adaptive Optimization Over p-Independence Systems 

In this section, we analyze a natural adaptive generalization 
of the greedy algorithm (;T) and its a-approximate versions to 
the adaptive realm. We call our adaptive generalization of an 
a-approximate algorithm an a-approximate greedy policy. 

Definition 6 (a-approximate greedy policy, for constraint X). 

A policy tt is an a-approximate greedy policj^J for constraint 
X if, upon observing partial realization *, tt selects an item e 
such that dom(*) U {e} £lf if such an e exists) and such that 

A(e I #)>£*•( max A(e' I *) ) . (6) 

\e':dom(*)u{e'}eI / 

Otherwise, if no such an item e exists, tt terminates. 

We are now ready to prove our main result: The adaptive 
generalization of the non-adaptive approximation guar- 
antee of an a-approximate greedy algorithm in the case of a 
p-independence system constraint. 

Theorem 7. Fix an adaptive monotone submodular function 
f : 2 E x O e — > K>o and a p-independence system (E,I). Fix 
a policy tt which is a-approximate greedy with respect to f for 
constraint X. Then tt yields an — ^ approximation, meaning 

\p-\-aJ feasible tt* 

where tt* is feasible iff E{w* , G I for all 

Proof. Our proof is an adaptive generalization of the elegant 
proof of Calinescu et al. \7\ for the non-adaptive case, which 
itself is based on a scheme of Jenkyns [ HI . We will not repro- 
duce that proof here, but instead point out the features of it that 
we need. Consider the non-adaptive case, and fix any maximal, 
feasible set of elements S = {ei, C2, ■ ■ ■ , e&}. Formally, S G I 
and S U {e} ^ 1 for all e € E \ S. Let Si := {e 1 ,e 2 , . . . , ej 
for all i. Let S* G 1 be any feasible set. As a key step in 
their analysis, Calinescu et al. prove the existence of a sequence 
of sets Si, S%, . . . , SI whose nonempty members partition S*, 
such that for all i and e G S*, Sj-i U {e} G X, and \S*\ <p 
for all i. We will call SI , . . . , S£ a decomposition of S* with 
respect to the sequence (ei, e2, . . . , efc). 

Now consider the adaptive case. Fix a-approximate policy tt 
and any feasible policy tt* . Let *j $ be the partial realization 
observed by tt after selecting min{i, k(Q)} elements under 
true realization $, where fc($) := \E(tt, $)|. We consider the 
decomposition of E(tt* , $) with respect to the sequence of ele- 
ments selected by tt under $, in the order they are selected by tt. 
Let E* ($), . . . , ($) be such a decomposition. Note that 
each e G E*. is feasible when the algorithm has observed 
^i.^, in the sense that dom(*i $) U {e} G X. Hence, any a- 
approximate policy tt must obtain conditional marginal expected 



'in 1101 . a-approximate greedy policies are defined for a > 1, whereas 
here we define them for a < 1 (as the inverse of the factor in 1 10 1) to maintain 
uniformity with previous work on matroid constraints and their generalizations. 
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benefitA(7r(* i)$ )|* i)# ) > a-A(e | for all e e £* +1 ($). 
Since |2?*, x (<£>) | < p, as all parts of the decomposition promised 
by Calinescu et al. must be, we have 



A (ttO^,*)!^) > - A ( e l**,*) 



(7) 



Note we can write / avg (7r) in the form 



/avg(7T) = E 1 
*:*Gdom(7r) 



A(tt(*)|*) 



(8) 



where $ is the true realization. Intuitively, if we consider the 
decision tree associated with it — where items are selected at 
nodes and the branches are determined by the selected item's 
state — then the previous equation is obtained by using linearity 
of expectation to decompose the expected benefit of it into a 
weighted sum of conditional expected marginal benefits of each 
node in the tree, where the weight of a node is the probability it 
is reached. We can then rearrange these terms, as 

/avg(TT) = E E P[$]-A(7T(*)|*) (9) 

= E P t $ ] E A(7r(*)|tf) (10) 

where A(7r(^')|5') = whenever ^ dom(7r). Rewriting 
Eq. ( [TO} using the notation and they applying Eq. (7) yields 

fc($)-i 

/avg(TT) = E P [*] E (11) 

* i=0 

fc(*)-l 

^ ^E p [ $ ] E E A (ei^,*) d2) 

Next we require the definition of policy concatenation [ 10 1 : 
Given two policies wi and 7r 2 define 7ri@7T2 as the policy ob- 
tained by running tti to completion, and then running policy 7r 2 
as if from a fresh start, ignoring the information gathered during 
the running of 7Ti. Hence under realization $ the policy 7Ti@7T2 
selects the union of items selected by 7Ti and ir 2 under $. Thus, 
£;(7ri@7r 2 , $) = £(71-1, $) U £(tt 2 , $). 

Now for each (e, <I>) such that e € -E(tt*, $) define * e) $ as 
the partial realization observed by 7r@7r* immediately before 
selecting e. We will assume without loss of generality that each 
element is selected by 7r@7r* at most once. (If this is not the case, 
we may for purposes of analysis replace the groundset E with 
E x N, order the instances in which 7r@7r* selects e arbitrarily, 
and replace the j th instance of e with (e, j). The objective 
function then simply treats all copies {(e, j) : j £ N} the same, 
as do the realizations.) Then for all e £ E* +1 ($), A (e | > 
A(e | ^e,*) by adaptive submodularity, since 9i ; <& C ^ e ^. Let 
RHS12 denote the right-hand-side of Eq. ( [T2] >. Combining the 



above fact with Eq. ( 12 ) yields 

rhs^ > -E 10 ^] E E A ( e i* e ,*) (1 3 ) 



fe($)-l 

E 



> 



^(/avg (7T@7r*)-/ avg (7r) 

^(/avg(vr*)-/ avg (7T)) 



(14) 
(15) 



To obtain Eq. (fl4j), note that {E* +1 ($) : < i < fc($) - 1} 
partitions E(tt, $), and we can decompose / avg (7r@7r* ) — 
/avg (tt) in a manner similar to how we obtained Eq. ( 1 TV Intu- 
itively, in Eq. (14) we are summing contributions over exactly 
those nodes in the decision tree corresponding to 7r@7r* which 
are there due to it*, and not over those there due to 7r. Lastly, 
Eq. (T5] ) follows from the adaptive monotonicity of /, because 
it implies / avg (ir@ir*) > / avg (7T*). For a proof of this last fact, 
see Lemma 30 of ifTOl . 

Finally, combining Eq. ( [TTj i through Eq. ( [T5| l yields / aV g(7r) > 

^ (/avg (tt* ) — /avg (tt) ^ which may be rearranged to yield the 
claimed bound of / avg (tt) > (^) / avg (vr* ). □ 



4. Application to Adaptive Match-Making 

In this section we consider an application of Theorem [7] to 
adaptive match-making problems such as online dating. We 
imagine we have an undirected graph G = (V,E) where V is 
the set of people using our match-making service, and E is a 
set of feasible pairings of people, i.e., {u, v} € E if v meets the 
requirements specified by u and vice-versa. The service will rec- 
ommend dates (corresponding to edges) to people, and after each 
date the participants will provide feedback o on the quality of 
their experience. We suppose there is a known distribution over 
how well each date {u, v} will turn out, based on the profiles of 
u and v. We also suppose that each person v has specified an 
upper bound d(v) on the number of dates desired. Subject to this 
constraint, we seek to maximize a known monotone submodular 
function / of sets of (date, feedback) pairs. For example, we 
could define a good date as one where the participants wish to 
go on another date together, let the feedback be whether the date 
was good or not, and define / as the number of users who expe- 
rienced at least one good date in the input set of (date, feedback) 
pairs. As another example for a feasible /, the feedback for a 
date {u, v} could be a pair of numeric scores corresponding to 
how satisfied u and v were with the date, respectively, and / 
could be the sum of user scores, where the score of a user u is 
the maximum score that u gave to any date. There are many 
other possibilities. 

Hence, we seek to solve Q where the items are edges, 
<&(e) encodes the feedback from each date e, and I con- 
sists of all subsets of edges whose induced subgraphs satisfy 
the degree constraints that each v has degree at most d(v). 
Formally, X := {A C E : Vu G V, deg A (v) < d(v)} where 
deg A (v) :— \{w : {v, w} £ A} |. Assuming the feedback from 
each date is an independent sample from its distribution, i.e., 
P [$] = Y\ e£E P [$(e)], then we have the following result. 



Theorem 8. Any a-approximate greedy policy it achieves 
as much reward in expectation as the optimal policy. In par- 
ticular, the greedy policy achieves a 1 / '3- approximation to the 
optimal policy. 

We spend the remainder of this section proving Theorem [8] 
We start by establishing that in our model the underlying objec- 
tive, f(A, <£>) := / ({(e, $(e)) : e £ A}), is adaptive monotone 
submodular. This follows from the following theorem, whose 
proof appears in iflOll . 

Theorem 9 (§6 of 1 10 1). Fix a monotone submodular function 
f : 2 Ex ° — > R> and a prior P[<&] with independent out- 
comes, so that P[$] = n eS B P [ $ ( e )]- Let f( A ,&) ■= 
/ ({(e, < &(e)) : e G ^4})- Then f is adaptive monotone submod- 
ular. 

Given / is adaptive monotone submodular, to prove Theo- 
rem|8]it suffices to show that the constraints in the match-making 
problem can be modeled by a p-independence system with p = 2, 
since then we can apply Theorem 7 to complete the proof. We 
now proceed to do so. 



Theorem 10. For all d : V 

system. 



N, (E,I) is a 2- independence 



Proof. If G is bipartite, with V = f/tfcl W being the bipartition 
(so that E C U x W), then this is easy to prove by establishing 
that I is the intersection of two matroids (E, Xjj) and (E, Iw)< 
where I s ■ = {A C E : Vi> £ 5, deg A (v) < d(v)} for all S € 
{U,W}. 

We prove the case of general graphs using a more direct 
approach. Recall Definition[2] and fix any C C E. Fix A and B 
to be arbitrary maximal independent subsets of C. 

For each {u, w} £ B, charge one unit to 
argmin^ G r u t(; i {d(v) — deg A (v)}. Break ties arbitrarily. 
Let c(v) be the total resulting charge to v. We claim for all v, 



c(v) < deg A (v) 



(16) 



Suppose, by way of contradiction, that c(v) > dcg A (v). Then 
some {u, v} 6 B \ A is charged to v, and so d(v) — deg A (v) < 
d(u) — deg A (u). Note c{w) < deg B (w) < d(w) for all w. 
These facts together imply 

1 < c(v) ~ deg A (v) < d(v) - deg A (v) < d(u) - deg A (u) 

Hence deg^(u) < d(u) and deg A (v) < d(v), which implies 
A U {{u, v}} £ 1, contradicting the maximality of A. 

Given Eq. ( 16), and the basic fact that J2 V deg E (v) = 2\E\ 
in any graph (V, E), we have 

\B\=Y / c(v)<Y, dc SA(v) = 2\A\ 

V V 

which completes the proof. 



5. Conclusions 

Adaptive submodularity 1 10 1 provides an elegant framework 
for analyzing myopic strategies for certain adaptive optimiza- 
tion problems, in much the same way that submodularity does 
for certain non-adaptive optimization problems. In this article 
we have shown how the greedy policy achieves near-optimal 
performance under a large parameterized class of constraints, 
the p-independence system constraints, and shown how to apply 
this result in the context of a complex adaptive match-making 
application. We believe that this and related results in [ 10 1 will 
prove useful for several other applications involving adaptive 
optimization. 
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